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INVISIBLE CONTROL OF SELF-ORGANIZING AGENTS 
LEAVING UNKNOWN ENVIRONMENTS* 
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Abstract. In this paper we are concerned with multiscale modeling, control, and simulation 
of self-organizing agents leaving an unknown area under limited visibility, with special emphasis on 
crowds. We first introduce a new microscopic model characterized by an exploration phase and an 
evacuation phase. The main ingredients of the model are an alignment term, accounting for the 
herding effect typical of uncertain behavior, and a random walk, accounting for the need to explore 
the environment under limited visibility. We consider both metrical and topological interactions. 
Moreover, a few special agents, the leaders, not recognized as such by the crowd, are “hidden” in 
the crowd with a special controlled dynamics. Next, relying on a Boltzmann approach, we derive a 
mesoscopic model for a continuum density of followers, coupled with a microscopic description for 
the leaders’ dynamics. Finally, optimal control of the crowd is studied. It is assumed that leaders 
aim at steering the crowd towards the exits so to ease the evacuation and limit clogging effects, and 
locally-optimal behavior of leaders is computed. Numerical simulations show the efficiency of the 
control techniques in both microscopic and mesoscopic settings. We also perform a real experiment 
with people to study the feasibility of such a bottom-up control technique. 

Key words. Pedestrian models, agent-based models, kinetic models, evacuation, herding effect, 
soft control. 
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1. Introduction. Self-organizing systems are not as mysterious as they were 
when researchers began to deal with them. A number of local interaction rules were 
identified, such as repulsion, attraction, alignment, self-propulsion, etc., and the large- 
scale outcome triggered by the combination of them is so far sufficiently understood 
[54] . Researchers have also achieved a certain knowledge regarding the control of self¬ 
organizing systems, namely the methods to dictate a target behavior to the agents 
still keeping the local rules active all the time. 

In this paper we are concerned with multiscale modeling, control and simulations 
of self-organizing agents which leave an unknown area. Agents are not informed about 
the positions of the exits, so that they need to explore the environment first. Agents 
are assumed not to communicate with each other and they cannot share information 
directly. Moreover, they are selfish and do not intend to help the other agents. 

The guiding application is that of a human crowd leaving an unknown environ¬ 
ment under limited visibility. In that case, we claim that people exhibit two opposite 
tendencies: on the one hand, they tend to spread out in order to explore the environ¬ 
ment efficiently, on the other hand they follow the group mates in the hope that they 
have already found a way out (herding effect). Controlling these natural behaviors 
is not easy, especially in emergency or panic situations. Typically, the emergency 
management actors (e.g., police, stewards) adopt a top-down approach: they receive 
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information about the current situation, update the evacuation strategies, and finally 
inform people communicating directives. It is useful to stress here that this approach 
can be very inefficient in large environments (where communications to people are dif¬ 
ficult) and in panic situations (since “instinctive” behavior prevails over the rational 
one). Let us also mention that in some cases people are simply not prone to follow 
authority’s directives because they are opposing it (e.g., in public demonstrations). 

In this paper we explore the possibility of controlling crowds adopting a bottom-up 
approach. Control is obtained by means of special agents (leaders) who are hidden 
in the crowd, and they are not recognized by the mass as special. From the modeling 
point of view, this is translated by the fact that the individuals of the crowd (followers) 
interact in the same way with both the other crowd mates and the leaders. 

Goals. We first introduce a new microscopic (agent-based) model characterized 
by an exploration phase and an evacuation phase. This is achieved by a combination 
of repulsion, alignment, self-propulsion and random (social) force, together with the 
introduction of an exit’s visibility area. We also consider both metrical and topological 
interactions so to avoid unnatural all-to-all interactions. A few leaders, not recognized 
as such by the crowd, are added to the model with a special, controlled dynamics. 

Second, relying on a Boltzmann approach, we derive a mesoscopic model for a 
continuum density of followers, coupled with a microscopic description for the leaders’ 
dynamics. This procedure, based on a binary interaction approximation, shows that, 
in a grazing collision regime, the microscopic-kinetic limit system is the mesoscopic 
description of the original microscopic dynamics. 

Finally, dynamic optimization procedures with short and long-time horizon are 
considered. It is assumed that leaders aim at steering the crowd toward the exits so 
to ease the evacuation and limit clogging effects. The goal is achieved by exploiting 
the herding effect. Locally-optimal behavior of leaders is computed by means of 
two methods: the model predictive control and a modified compass search. Several 
numerical simulations show the efficiency of the optimization methods in both the 
microscopic and mesoscopic settings. 

Beside numerical simulations, we performed a real experiment with people to 
study the feasibility of the proposed bottom-up crowd control technique. 

Relevant literature. This paper falls in several crossing, and often independent, 
lines of research. Concerning pedestrian modeling, virtually any kind of models have 
been investigated so far and several reviews and books are available. For a quick 
introduction, we refer the reader to the reviews 0 @23 and the books [22 HI] • Some 
papers deal specifically with evacuation problems: a very good source of references 
is the paper [T] , where evacuation models both with and without optimal planning 
search are discussed. The paper ;I] itself proposes a cellular automata model coupled 
with a genetic algorithm to find a top-down optimal evacuation plan. Evacuation 
problems were studied by means of lattice models (23(33, social force models )49j . 
cellular automata models !H 133, mesoscopic models |2], and macroscopic models [21]. 
Limited visibility issues were considered in [2HI23U33. A real experiment involving 
people can be found in m- 

The control of self-organizing agents can be achieved in several ways. For example, 
the dynamics of all agents can be controlled by means of a single control variable, 
which remains active at all times (3j- This kind of control is suitable, e.g., to model 
the influence of television on people. Alternatively, every agent can be controlled by an 
independent control variable. This is the most effective but also the most “expensive” 
control technique because of the large number of interactions with the agents that is 
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required m- A more parsimonious control technique, called sparse control, can be 
obtained by penalizing the number of these interventions by means of an L 1 control 
cost, so that the control is active only on few agents at every instant mnn he], if 
the existing behavioral rules of the agents cannot be redesigned, the control of the 
system can be obtained by means of external agents. Here the literature splits in two 
branches: the case of recognizable leaders, characterized by the fact that the external 
agents have an influence on normal agents stronger than the influence exerted by 
group mates (like, e.g., a celebrity) [51 IT51 1521 Wf\ : and the case of invisible leaders, 
which are instead characterized by the fact that the external agents are completely 
anonymous and are perceived as normal group mates [23 EU 3D. 1 . 33] • Alternative 
approaches, based on the control of the surrounding environment rather than of the 
agents themselves are proposed in (23 133 S3- 

Let us focus on the invisible control, also known as soft control. In the seminal 
paper [26] a repulsion-alignment-attraction model is considered, for which the authors 
show that a little percentage of informed agents pointing towards a target is sufficient 
to steer the whole system to it. The papers 03111] deal with a Vicsek-like (pure 
alignment) model with a single leader. A feedback control for the leader is proposed, 
which is able to align the whole group along a desired direction from any initial 
condition. Invisible control was successfully implemented in real experiments involving 
animals, where leaders come in the form of disguised robots; see, e.g., [35] j which deals 
with cockroaches and [TBj, which deals with zebrafishes. 

The complexity reduction of microscopic models when the number of interacting 
agents is large has received deep interest in the last years. The main technique to 
address the curse of dimensionality is to recast the original microscopic dynamics in 
the form of a PDE, by substituting the influence that the entire population has on 
a single agent with an averaged one. Several approaches have been used to derive 
rigorously this procedure, like the BBGKY hierarchy [38], the mean-field limit m 
H3, or the binary interaction approximation 0] 0H]. Furthermore, only recently this 
problem has been investigated for optimal control of multi-agent systems [35. 

In the case of the control by means of external agents, if the number of leaders 
is relatively small in comparison to the crowd of followers, it might be convenient 
to apply the above dimension reduction techniques to the followers’ population only 
HIES- The resulting modeling setting is described by a coupled ODE-PDE sys¬ 
tem, where we control the microscopic dynamics in order to steer the macroscopic 
quantities. Nonetheless, from the computational point of view, one has to face the 
numerical tractability of the optimal control problem, whose solution requires the iter¬ 
ated evaluation of the macroscopic system. Since the underling dynamics are typically 
non-linear and imply the resolution of high-dimensional integro-differential terms, ad 
hoc approaches have to be developed in order to avoid time-consuming strategies. 
One possibility is to use a fixed dynamics for the system of ODEs, assigned by the 
modelers or as feedback control derived by a properly designed functional, see, e.g., 

0123]. 

Paper organization. The paper is organized as follows. In Section [2] we introduce 
the main features of the model, valid at any scale of observation. In Section [3] we 
present in detail the model at the microscopic scale. In Section [4] we derive the 
associated Boltzmann-type equation and the coupled ODE (micro) - PDE (kinetic) 
model by means of the grazing collision limit , following the path laid out in 03- 
In Section [5] we introduce the control and optimization problem and in Section 0 
we show the results of the numerical tests. In Section [7] we report the results of 
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a real experiment with pedestrians aimed at validating the proposed crowd control 
technique. We conclude the paper sketching some research perspectives. 

2. Model guidelines. In this section we describe the model at a general level. 
Hereafter, we divide the population between leaders, which are the controllers and 
behave in some optimal way (to be defined), and followers, which represent the mass 
of agents to be controlled. Followers cannot distinguish between followers and leaders. 

First- vs. second-order model. One can notice that walking people, animals and 
robots are in general able to adjust their velocity almost instantaneously, reducing to 
a negligible duration the acceleration/deceleration phase. For this reason, a second- 
order (inertia-based) framework does not appear to be the most natural setting. How¬ 
ever, if we include in the model the tendency of the agents to move together and to 
align with group mates we implicitly assume that agents can perceive the velocity 
of the others. This makes unavoidable the use of a second-order model, where both 
positions and velocities are state variables. See [231 Sect. 4.2] for a general discussion 
on this point. In our model we adopt a mixed approach, describing leaders by a first- 
order model (since they do not need to align) and followers by a second-order one. 
In the latter case, the small inertia is obtained by means of a fast relaxation towards 
the target velocity. 

Metrical vs. topological interactions. Interaction of one agent with group mates 
is said to be metrical if it involves only mates within a predefined sensory region, 
regardless of the number of individuals which actually fall in it. Interaction is instead 
said topological if it involves a predefined number of group mates regardless their 
distance from the considered agent. Again, in our model we adopt a mixed approach, 
assuming short-range interactions to be metrical and long-range ones to be topological. 

Isotropic vs. anisotropic interactions. Living agents (people, animals) are gen¬ 
erally asymmetric, in the sense that they better perceive stimuli coming from their 
“front”, rather than their “back”. In pedestrian models, e.g., interactions are usu¬ 
ally restricted to the half-space in front of the person, since the human visual field 
is approximately 180°. Nevertheless, humans can easily see all around them simply 
turning the head. In the case we are interested in, people have no idea of the location 
of their target, hence we expect that they often look around to explore the environ¬ 
ment and see the behavior of the others. This is why we prefer to adhere to isotropic 
interactions. 

Let us describe the social forces acting on the agents. 

Leaders. 

• Leaders are subject to an isotropic metrical short-range repulsion force di¬ 
rected against all the others, translating the fact that they want to avoid 
collisions and that a maximal density exists. 

• Leaders are assumed to know the environment and the self-organizing features 
of the crowd. They respond to an optimal force which is the result of an offline 
optimization procedure, defined as to minimizing some cost functional (see 
Section [5| . 

Followers. 

• Similarly to leaders, followers respond to an isotropic metrical short-range 
repulsion force directed against all the others. 

• Followers tend to a desired velocity which corresponds to the velocity the 
would follow if they were alone in the domain. This term takes into account 
the fact that the environment is unknown. Followers describe a random walk 
if the exit is not visible (exploration phase) or a sharp motion toward the 
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exit if the exit is visible (evacuation phase). In addition, we include a self¬ 
propulsion term which translates the tendency to reach a given characteristic 
speed (modulus of the velocity). 

• If the exit is not visible, followers are subject to an isotropic topological 
alignment force with all the others, i.e., they tend to have the same velocity 
of the group mates. Assuming that the agents’ positions are close enough, 
this corresponds to the tendency to go where group mates go (herding effect). 

3. The microscopic model. In this section we introduce the microscopic model 
for followers and leaders. We denote by d the dimension of the space in which the 
motion takes place (typically d = 2), by N F the number of followers and by N h <C N F 
the number of leaders. We also denote by Q = the walking area and by x T € 
the target point. To define the target’s visibility area, we consider the set S, with 
x T G £ C fl, and we assume that the target is completely visible from any point 
belonging to £ and completely invisible from any point belonging to fl\£. 

For every i = 1,...,N F , let (xi(t ),G R 2d denote position and velocity of 
the agents belonging to the population of followers at time t > 0 and, for every 
k = 1 ,...,N h , let ( Uk{t),Wk(t )) € R 2d denote position and velocity of the agents 
among the population of leaders at time t > 0. Let us also define x := (xi ...., xw) 
and y := (j/i,... ,ym)- 

Finally, let us denote by B r (x) the ball of radius r > 0 centered at x € fi and by 
Bjg-(x-,x,y) the minimal ball centered at x encompassing at least A f agents, and by 
J\f* the actual number of agents in %(i: x, y). Note that A f* > AT. 

Remark 3.1. The computation of Bj^[x ;x, y) requires the knowledge of the 
positions of all the agents, since all the distances |— x\, i = 1,..., N F , and \yk — x\, 
k = 1,..., N L must be evaluated in order to find the N closest agents to x. 

The microscopic dynamics described by the two populations is given by the fol¬ 
lowing set of ODEs: for i = 1,..., N F and k = 1,..., N L , 

(3.1) 

! ±i = Vi, 

Vi = A{xi,Vi ) + J2jL i H F (xi,Vi,Xj,Vj\x, y) + J2eLi H L (xi,Vi,ye,w e ;x, y), 

Vk = Wk = Ejli KF (hk, xj) + KL (Vk, ye) + u k . 

We assume that 

• A is a self-propulsion term, given by the relaxation toward a random direction 
or the relaxation toward a unit vector pointing to the target (the choice 
depends on the position), plus a term which translates the tendency to reach 
a given characteristic speed s > 0 (modulus of the velocity), i.e., 

(3.2) 

A(x,v) := 0{x)C z (z - v) + (1 - 0(x))C T ^ - uj + C s (s 2 - |v| 2 )v, 

where 6 : R d —> [0,1] is the characteristic function of f2\E, 6(x) = Xn\s(a:)) 2 
is a d-dimensional random vector with normal distribution A/”(0, cr 2 ), and C z , 
C T , C s are positive constants. 

• The interactions follower-follower and follower-leader coincide and are equal 
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(3-3) 

H F (x, v, x', v'; x, y) := x') + 0(x)^ iy' 

H L (x, v, y, w; x, y) := H F (x , v, y, w; x, y), 


v ) )i 


for given positive constants C F ,C a ,r , and 7 , and 


R lt r(x, x') 


{ „— \x'— x| T x —x 
\x'-x\ 

0 


if x' £ B r (x)\{x}, 
otherwise, 


models a (metrical) repulsive force, while the second term accounts for the 
(topological) alignment force, which vanishes inside E. Note that, once the 
summations over the agents ]Th H F and are done, the alignment term 

models the tendency of the followers to relax toward the average velocity of the 
AT closer agents. With the choice H F = H L the leaders are not recognized by 
the followers as special. This feature opens a wide range of new applications, 
including the control of crowds not prone to follow authority’s directives. 

• The interactions leader-follower and leader-leader reduce to a mere (metrical) 
repulsion, i.e., I\ F = K L = —C f Rq^ t , where Cf > 0 and £ > 0 are in general 
different from C F and 7 , respectively. Note that here the repulsion force is 
interpreted as a velocity field, while for followers it is an acceleration field. 

• Uk : R + —t R dAr is the control variable, to be chosen in a set of admissible 
control functions. 

Remark 3.2. The behaviour of the leaders is entirely encapsulated in the control 
term u but for a short-range repulsion force. The latter should be indeed interpreted 
as a force due to the presence of the others, thereby non controllable. 


4. Formal derivation of a Boltzmann-type equation. As already men¬ 
tioned, our main interest in (3.1 1 lies in the case N L <C N F , that is the population 
of followers exceeds by far the one of leaders. When N F is very large, a microscopic 
description of both populations is no longer a viable option. We thus consider the 
evolution of the distribution of followers at time t > 0 , denoted by f(t, x, v), together 
with the microscopic equations for the leaders (whose number is still small). To this 
end, we denote with m F the total mass of followers, i.e., 


m F (t) 


/ f(t , x, v) dx dv, 
J R 2d 


which we shall eventually require to be equal to N F . We introduce, for symmetry 
reasons, the distribution of leaders g and their total mass 


N L 

(4.1) g(t,x,v) = '52S( yk (t), Wk (t))(x, v ), 

k= 1 



g(t, x , v) dx dv = N L . 


The evolution of / can be then described by a Boltzmann-type dynamics, derived 
from the above instantaneous control formulation, which is obtained by analyzing the 
binary interactions between a follower and another follower and the same follower 
with a leader. The application of standard methods of binary interactions, see j22] , 
shall yield a mesoscopic model for the distribution of followers, to be coupled with 
the previously presented ODE dynamics for leaders. 
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To derive the Boltzmann-type dynamics, we assume that, before interacting, each 
agent has at his disposal the values x and y that he needs in order to perform its 
movement: hence, in a binary interaction between two followers with state parameter 
(x,v) and (x,v), the value of H F (x,v,x,v;x.,y) does not depend on x and y. In the 
case of H F of the form (3.3), this means that the ball _By(x;x, y) and the value of 
J\f* have been already computed before interacting. 

Moreover, since we are considering the distributions / and g of followers and 
leaders, respectively, the vectors x and y are derived from / and g by means of the 
first marginals of / and g , 7Ti / and r x\g 1 respectively, which give the spatial variables 
of those distribution. Hence, since no confusion arises, we write H F (x, v, x, v: n\f, 7 Ti g) 
in place of H F (x,v,x,V',x, y) to stress the dependence of this term on / and g. 

We thus consider two followers with state parameter (x, v) and (x, v) respectively, 
and we describe the evolution of their velocities after the interaction according to 

v* = v + rf [ 0{x)C z i; + S(x, v) + m F H F (x, v, x, v; nif , -Kig )], 
v* = v + rf [0(x)C z £ + S(x, v ) + m F H F {x , v, x, v; 7Ti /, 7 Ti <7)] , 

where rf is the strength of interaction among followers, £ is a random variables whose 
entries are i.i.d. following a normal distribution with mean 0 , variance f 2 (which shall 
be related to the variance cr 2 of the random vector 2 in ( |3.2| in such a way that we 
recover from (4.2) a Fokker-Planck-type dynamics for /), taking values in a set B , 
and S is defined as the deterministic part of the self-propulsion term (3.2 1 , 

(4.3) S(x, v ) = -6{x)C z v + (1 - 0(x))C T 


(4.2) 


x‘ — X 


— V 


+ c s (s 2 -M>. 


We then consider the same follower as before with state parameters (x, v) and a leader 
agent (x,v); in this case the modified velocities satisfy 


(4.4) 


v ** = v + r/ L m L H L (x, v, x, v; 7ir /, 7Ti<?), 
~ 

v = v, 


where rf is the strength of the interaction between followers and leaders. Note that 

(4.4) accounts only the change of the followers’ velocities, since leaders are not evolving 
via binary interactions. 

Since we are interested in studying the problem in the widest possible framework, 
and avoid sticking only to the previous choice of the functions S, H F and H L , in what 
follows we assume that 


(Irrv) the systems \A.2\ and ( |4.4[ ) constitute invertible changes of variables from (v, v) 
to (v*,v*) and from (v,v) to (v**,v*), respectively. 

The time evolution of / is then given by a balance between bilinear gain and loss 
of space and velocity terms according to the two binary interactions (4.2) and (4.4), 
quantitatively described by the following Boltzmann-type equation 

(4.5) d t f + v V x f = A F Q(/, /) + A L Q(/, g), 

where A F and A L stand for the interaction frequencies among followers and between 
followers and leaders, respectively. The interaction integrals Q{f,f) and Q(f,g) are 
defined as 

1 -/(t,x*,u*)/(f,x*,'0*) - f(t,x,v)f(t,x,v) 


Q(f,g)(t) = E 


J F 

y rf{t,x * 


dx dv 


,v**)g(t,x*,v*) — f(t,x,v)g(t,x,v) I dx dv 
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where the couples (a:*,i>*) and (x*,v*) are the pre-interaction states that generates 


(x,v) and (x,v) via (4.2), and J F is the Jacobian of the change of variables given by 
(4.2) (well-defined by (Inv)). Similarly, (a;**,?;**) and (a;*,?;*) are the pre-interaction 
states that generates (x, v) and (x,v) via (4.4), and J L is the Jacobian of the change 


of variables given by (4.4). Moreover, the expected value E is computed with respect 
to £ G B. 

In what follows, for the sake of compactness, we shall omit the time dependency 
of / and g , and hence of Q(f,f) and Q{f,g ) too. 

Remark 4.1. If we would have opted for a description of agents as hard-sphere 


particles, the arising Boltzmann equation (4.51 would be of Enskog type, see m 
The relationship between the hard- and soft-sphere descriptions (i.e., where repulsive 
forces are considered, instead) has been deeply discussed, for instance, in W- In our 
model, the repulsive force i? 7r is not singular at the origin for computational reasons, 
therefore the parameters 7 and r have to be chosen properly to avoid arbitrary high 
density concentrations. 

4.1. Notations and basic definitions. We shall start the analysis of equation 
(4.5) by fixing some notation and terminology. First of all, for any /3 £ N d we set 
|/?| = 1 Pit and f° r any function h £ C p (R d x R d ,R), with p > 0 and any j3 £ N d 

such that |/3| < p, we define for every (x, v) £ R d x 


9Sh(x,v) := 


dM h 

d^v\ ■ ■ ■ d^ d Vd 


{x,v), 


with the convention that if /? = (0 ,..., 0) then d)jh(x, v) := h(x, v). 

Definition 4.2 (Test functions). We denote with Ts the set of compactly sup¬ 
ported functions ip from R 2d to R such that for any multi-index f3 £ we have: 

1. if |/3| < 2, then df)jip(x, •) is continuous for every x £ R d ,' 

2. if \P\ = 2, then there exists M > 0 such that: 

(a) d^ip{x,-) is uniformly Holder continuous of order 6 for every x £ R d 
with Holder bound M, that is for every x £ and for every v,w £ R d 
it holds 


|j d%ip(x,v) - d^ip(x,w)\\ < M\\v - -u;|| d ; 

(b) \\d^ip{x,v)\\ < M for every (x,v) £ R 2d . 

Notice that Cf°(R d x R d ;l) C 7j for every 0 < S < 1. 

Let A4o(® 2d ) denote the set of measures taking values in R 2d . For any two 
function / and ip from M 2d to R for which the integral below is well-defined (in 
particular, when / £ A4o(®l 2d ) and <p £ Ts), we set 

(/,¥>) := / V{x,v)f{x,v) dx dv. 

Jm 2d 

Finally, we say that (f,yi,w\, ..., j/jv l > wjv l ) G A4o(®l 2d ) x R 2dNL is admissible if 
the following quantities exist and are finite: 

(*) f R 2 d S(x,v)f(x,v) dx dv, 

(ii) f R 4 d H F (x, v, x, v; 7 n/, y )f(x, v)f(x, v ) dx dv dx dv, 

(Hi) zZif** H L (x, v, yi, we, TT 1 f,y)f(x, v) dx dv, 

(iv) f R2d K F (y k , x)f(x, v) dx dv for every k = 1,..., N L , 
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( V) K L (yk, ye) for every k = 1,..., N L . 

Consequently, we introduce our definition of solution for the combined ODE-PDE 
system for the dynamics of microscopic leaders and mesoscopic followers. 

Definition 4.3. Fix T > 0, 5 > 0, and a : [0,T] — > . By a 6- weak solution 

of the initial value problem for the equation 


dtf + v ■ V x f = A F Q(/, f) + A L Q(/, g), 


(4.6) 


. N h 

y k =w k = / K F (y k ,x)f(x,v) dx dv+ y^K L (y k ,y e ) + u k , 

J ^ d 7^i 


corresponding to the control u and the initial datum (/°, y k ,..., y° N f) € A4q 


in the interval [0,T], we mean any (/, yi ,..., yN L ) £ T 2 ([0, T], A^o(K 2d )) x 
C 1 ([0,T],R dN '') such that: 

1. /( 0,x,v) = f°(x,v) for every ( x,v ) € R 2d ; 

2. the vector {f{t),yi(t),yi(t),...,yN i -(t),ym{t)) is admissible for every t £ 

[o,n- 

3. there exists Rt > 0 such that supp(/(£)) C B Rt { 0) for every t £ [0,T]; 

4■ m F (t) = N F for every t £ [0,T]; 

5. f satisfies the weak form of the equation (4.5), i.e., 

^ (/, <P) + (/,«• V^) = A F <Q(/, f),<p) + A L (Q(f, g), <p ), 
for all t £ (0, T] and all ip £ Ts, where 

(4.7) 

(Q{f, /), tp) = E ( [ (ip(x, v*) - ip(x, v)) f(x, v)f(x, v) dx dv dx dv 

\Jw id 

(4.8) 

{Q{f, 9), V) = E ( / v**) - <p(x, v)) f(x, v)g{x, v) dx dv dx dv) , 


and where v* and v** are given by (4.2) and (4.4), respectively; 
6. for every k = 1,, N L , y k satisfies in the Caratheodory sense 


N L 


Vk=w k = / K F (y k ,x)f(x,v) dx dv+} K L (y k ,y e ) + u k . 


lR 2d 


= 1 


4.2. The grazing interaction limit. In order to obtain a more regular operator 


than the Boltzmann operator in (4.5), we introduce the so called grazing interaction 
limit. This technique, analogous to the so-called grazing collision limit in plasma 
physics, has been thoroughly studied in [55] and allows, as pointed out in [48], to pass 
the binary Boltzmann description introduced in the previous section to the mean-field 
limit. This process, while somehow losing the microscopic interaction rules (4.2)- 
( |4.4[ ), allows a better understanding of the solution behavior for large times, see 
[ 6 [ 150]. Moreover, the regularized operator of Fokker-Planck-type will directly show 
the connection with the microscopic model. 

In what follows, we shall assume that our agents densely populate a small re¬ 
gion but weakly interact with each other. Formally, we assume that the interaction 
strengths if and rf scale according to a parameter e, the interaction frequencies A F 
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and A L scale as 1/e, and we let e —► 0. In order to avoid losing the diffusion term in 
the limit, we also scale the variance of the noise term c 2 as 1/e. More precisely, we 
set 

(4.9) rf = e, if = e, A F = —^ A L = <r 2 = — • 

em F em. e 


We now study the Boltzmann equation (4.51 to see if simplifications may occur under 
the above scaling assumptions. 


Let us fix T > 0, 5 > 0, e > 0, a control u : [0, T\ M dJV , and an initial datum 
(/°, yi, ..., y % L ) £ A4o(K 2d ) x We consider a <5-weak solution (/, j/i,..., Vn l ) 

of system (4.6 1 with control it, initial datum (/°, yj,..., y^ L ) and define g as in (4.1). 
Let the scaling (4.9) hold for the chosen e. Following the ideas in mm , we can 
expand ip(x,v*) inside (4.7) in Taylor’ series of v* — v up to the second order, to get 


\f 

z JR 4d 


(<?(/, /),<£>) = E ( / V v <p(x,v) ■ (v* - v) f(x,v)f(x,v) dx dv dx dv 

\ J R 4d 

where the remainder _R F of the Taylor expansion has the form 

7 p d 

/ ( d v’ 3)l P( x ’ v ) - (v* - v) i (v* - v) • 


V ) 0* - v)i ( V* - v) j 

i,j= 1 


■iff 


^= E 2 


f(x,v)f(x,v) dx dv dx dv 


with v F = d F v* + (1 — d F )v, for some d F £ [0,1]. By using the substitution given by 
the interaction rule ( |4.2| ), i.e. 

v* —v = rf [6{x)C z £ + S(x, v) + m F H F (x, v, x, v; mf, mg )], 


we obtain 


/ V v ip(x,v) ■ (v* — v) f(x, v)f(x, v) dx dv dx dv 
J R 4d 

= V F S7 v ip(x, v) ■ [0(x)C z £ + S{x, v) + m F H F (x, v, x, v; ir 1 f, mg)] 
J R 4d 

f(x,v)f(x,v) dx dv dx dv 

(4.10) = rfnf (/, V v ip • 0C& + i? F m F (/, • (S + U F [f ])), 

having denoted with 

H F [f](x,v) = / H F (x,v,x,v-,n 1 f 1 n 1 g)f(x,v)dxdv. 

J R 2d 


By taking the expected value w.r.t. £ € B of (4.10), the term ifm F {f,V v (p ■ 9C Z £) is 
canceled out, since E (£) = 0. 
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Writing (whenever possible) H F , H h , 8 and S in place of H F (x,v,x,v-,TTif,TTig), 
H h (x , v, x, v\ 7 Tif, tti g), 0{x) and S(x, v), respectively, the same substitution yields for 
the second order term 


IJ 

z J R 4d 


y d v’ j)i p( x ’ v ) ( v * - v )i ( v * - v )j 


i,j =1 


f(x,v)f(x,v) dx dv dx dv 


= \{rf? [ 

z JR 4 


J2 di^ifiix, v) ( 9CJ; + S + m F H F ) i 
i,j =1 


{0C z S + S + m F H F ). 


f(x,v)f(x,v) dx dv dx dv. 


Notice that if we compute the expected value of the expression above, all the cross 
terms j vanish, since they are drawn independently from each other. Hence we 
obtain 

A F (Q(f, /), ip) = \ F rfm F (/, V v <p ■ ( S + H F [f})) + ^A F ( V F f mV </, [8C z f A v <p) 

+ ^A F (r, F ) 2 E F v [fJ}+X F R F v , 

where the notation S F [/, /] stands for 


Af,f} = 


/R 4d 


y, (5 + m F H F ) i {S + m F H F ) j 

i,j =1 


f(x,v)f(x,v) dx dv dx dv. 


By means of the same computations, we can derive a similar expression for ( Q(f , g),tp). 
Indeed, from (4.8) we have 


L d l 

V’ 


A L (Q(f,g),<p) = A L ?? L m L (/, V v ip-H L [g]) + -A L (r? L ) 2 [f,g] + X h R\ 

having denoted with 

'H h [g](x,v)= / H L (x,v,x,v;irif,Tr 1 g)g(x,v)dxdv, 

J R 2d 


[f,g\ = {m L ) 2 [ 

Jr 4 


i,j =1 


d^ip{x,v)H\H\ 


f(x,v)g(x,v) dx dv dx dv , 


^= E 2 


r R 4d 


a 

y [d^ 3) ^p{x,v) - ( v ** - v) i (v** - v)j 


i,j =1 


f(x,v)g(x,v) dx dv dx dv 


with v h = d h v** + (1 - for some d L G [0,1], 
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If we now apply the rescaling rules (4.91, the following simplifications occur 
A Vm F = 1, X h r/ L m L = 1, A F ( rff g 2 m F = a 2 , A L ( f = e, A L ( r/ L f = e. 
Hence, if we let e —> 0 and we assume that, for every <p € 7a, 

^ 111—'L 


(411) limflUS^/,/]) 


J/.fl] I 


- ||i£ 


-iii?: 

e 


VI 


= 0 


holds true, we obtain the weak formulation of a Fokker-Planck-type equation for the 
followers’ dynamics 

(4-12) (f,<p) + (f,v- V x <p) = (f,V v <p-G[f,9] + ^cr 2 {eC z ) 2 A v ip\ , 


where 


g[ f: g] = s + n F [f} + n L [g}. 


We leave the details of the proof of the limit (4.11 


Since ip has compact support, equation (4.12) can be recast in strong form by 


to the following Section 4.3 


means of integration by parts. Coupling the resulting PDE with the microscopic 
ODEs for the leaders k = 1,..., N L , we eventually obtain the system 

dtf + v- W x f = -V„ • (G [/, g] /) + \o 2 (PC z ) 2 A v f, 

r A L 

yk = Wk = / K ¥ (y k ,x)f(x, v) dx dv + 'Y' K h (y k , y e ) + u k . 

t=i 


(4.13) 


Remark 4.4. Observe that, assuming f in (4.13) to be the empirical measure 


/ = Yli —i fi(xi( i),vi(t)) concentrated on the trajectories (xi(t ), Vi(t)) of the microscopic 
dynamics (3.1), we recover the original microscopic model itself. 


Remark 4.5. In terms of model hierarchy one could imaging to compute the 


moments of (4.13) in order to further reduce the complexity. Let us stress that deriving 


a consistent macroscopic system from the kinetic equation is in general a difficult task, 
since equilibrium states are difficult to obtain, therefore no closure of the moments 


equations is possible. For self-organizing models similar to (4.13), in the noiseless 


case (i.e. er = 0), a standard way to obtain a closed hydrodynamic system is to assume 
the velocity distribution to be mono-kinetic, i.e. f(t,x,v ) = p(t,x)6(v — V(t,x)), and 
the fluctuations to be negligible, thus computing the moments of ( |4. 13 1 leads to the 
following macroscopic system for the density p and the bulk velocity V, 


(4.14) 


dtp + V x • (pV) = 0, 

dt(pV) + v* • { P v ® v) = g m [p, p\ v, p l ] P , 


yk 


r N L 

= w k = / K F (y k , x)p(t, x) dx + Y K h (yk, Vi) + u k 
Js. d 


where p L (x,t),V h (x,t) represent the leaders’ macroscopic density and bulk velocity, 
respectively, and Gm the macroscopic interaction operator, see JM Ml for further de¬ 
tails. For cr > 0, the derivation of a macroscopic system depends highly on the scaling 
regime between the noise and the interaction terms, see for example 1 !f . \2(A Fur¬ 
thermore, the presence of diffusion operator in model (4.13) depends on the spatial 


domain, therefore the derivation of a reasonable macroscopic model is not trivial and 
it is left for further studies. 
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4.3. Estimates for the remainder terms. Motivated by the results in [50] . 
we shall estimate the quantity ||i? F || as follows: since v F = d F v* + (1 — d F )v implies 
||t> — u F || < d F ||i> — V*||, then for every p G Ts it follows that 

d!?’ j) ip(x,v) — d!f’^ip(x,v F ) < M\\v — u F || l5 < M (d F ) 5 \\v — i>*|| <5 . 

Hence we get 

- Il-Rlo II < —M (d F ) d E ( [ \\v* — v\\ 2+S f(x,v)f(x,v) dx dv dx dv) 

e 2e Vi R 4d J 

= (i9 F ) S |t 7 f | 2+,s E f f \\6£, + S + m F H F \\ 2+5 f(x,v)f(x,v) dx dv dx dv 

2e Via 4d 

= ( , d F ) 5 |e| 1+<5 E ( f \\9f + S + m F H F \\ 2+s f(x,v)f(x,v) dx dv dx dv\ . 

From the inequality 

||0£ + S + m F H F \\ 2+S < 2 2+2<5 (j|0 £|| 2+5 + ||,S || 2+4 + ||m F H F || 2+l5 ) , 
and since 6 G [ 0 , 1 ], we obtain 

II^H < 2 1+25 M (d F )‘ ) |e| 1+d ( (to f ) 2 E ^||C|| 2+<5 ) +mF /, d W S \\ 2+S f( x ’ v ) dx dv 


+ (m F ) 


2+<5 


|^ F || 2+(5 v)f(x, v) dx dv dx dv J. 


Analogously for I lift, 11 we get 


||f? F || < -M ('d L ) <5 |e | 1+l5 (rn h ) 2+S j \\H L \\ 2+S f(x,v)g(x,v) dx dv dx dv. 

2 i]R 4d 

Similar computations performed on the terms S F [/, /] and Sh [/, g] yield the inequal¬ 


ities 


— II" F 
2 ^ 


£ II—L 
2 ^ 


[/,/]|| < ^Md F (m F [ || S\\ 2 f(x,v) dx dv 

^ v "' R2d 

+ (m F ) 2 f \\H F \\ 2 f(x,v)f(x,v) dx dv dx dv ], 

jR id J 

[f,g] || < ^ Md h (m L f [ \\H F \\ 2 f(x,v)g{x,v) dx dv dx dv. 

* JR. id 


Remembering that solutions of (4.6) have support uniformly bounded in time and 


finite constant mass (i.e., m F (t) = ?n F ( 0 ) < +oo), we have proven the following result. 

Theorem 4.6. Fix T > 0, 8 > 0, u : [0,T] -A R dNL , and let, for every e > 0, 
(f E ,yf, ■ ■ ■ ,y%f) be a S-weak solution of (4.6) corresponding to the control u and the 
initial condition (f°, 2 /?, • - ■, y%i), and where the quantities if, if, A F , A L , and f 2 are 
rescaled w.r.t. e according to (4.9). Suppose that: 
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1. E ^||£|| 2+i5 ) finite, 

2. the functions S, H F and H L are in L^ oc for p = 2,2 + 5. 

Then, as £ —> 0, the solutions (f s , y\,..., y e N i.) converge pointwise, up to a 
quence, to (/, yi ,..., yjv L )> where f satisfies the Fokker-Planck-type equation 
with initial datum (/°, yj, ■ • •, V%.), and for every k = 1,..., N h , y k satisfies 


subse- 

(p2) 


{ Vk = w k = 
Vk{ 0) = Vk- 


R 2d 


jv l 

K F (yk, x)f(x, v) dx dv + 'Y^ K L {yk, ye) + u k , 

e=i 


Remark 4.7. The hypothesis of admissibility in Definition ^ -3\ makes sure that 
all the integrals considered in the proof of Theorem \4-b\ exist and are finite. 

Remark 4.8. The choice of the functions S,H F and H L made in ( |4.3[ ) and (3.31 
clearly satisfy the hypotheses of Theorem \fi6\ fo r any 5 > 0. Hence, we expect the 
simulation of system (4.61 with the scaling (4.91 to be a good approximation of the 
ones of the original microscopic model (3.1) for sufficiently small values of e. 


5. Optimal control of the crowd. In this section we discuss how one can 
(optimally) control the crowd of followers by means of “invisible” leaders, whose 
dynamics are obtained by the minimization of certain cost functionals. It is well 
known that in case of alignment-dominated models the invisibility of leaders is not a 
limitation per se, but nothing is known about more complex models. In particular, 
the coupling between random walk and alignment gives rise to interesting phenomena, 
cf. [53] - Note that, in order to fit real behavior (see also Section [7]), the ratio between 
random walk force and alignment force should be large enough to assure a complete 
exploration of the domain, and small enough to catch the herding behavior. By the 
way, this also narrows the choice of parameters. 


5.1. A preliminary test. To fix the ideas, we plot a typical outcome of the 
microsimulator when the crowd is far away from the exit. At the initial time, agents 
are uniformly distributed in a square, see Fig. |5.1| We refer to this situation as 
Setting 0 (model parameters for this and following simulations are reported in Table 
|5.1[ ). With no leaders, the group splits in several subgroups, each of them having a 
common direction of motion (local consensus is reached in very short time), see Fig. 
|5.1f left. With 5 leaders moving rightward, the crowd reaches immediately a consensus 
and align with leaders, see Fig. |5.1[ center. If, for some reason, leaders cease to be 
influential, the crowd tend to split again, see Fig. |5. If right. 


Table 5.1 
Model parameters. 


Setting 

N L 

N F 

A f 

C F 

a L 

c a 

c z 

C T 

c s 

s 2 

r = C 

7 

0 

0-5 

150 

10 

2 

1.5 

2 

0.25 


1 

0.5 

0.4 

1 

1 

0-3 

150 

10 

2 

1.5 

3 

0.2 

1 

1 

0.5 

0.4 

1 

2 

0-2 

100 

10 

2 

1.5 

3 

0.2 

1 

1 

0.5 

0.4 

1 

3 

0-6 

30-70 

10 

1 

0 

- 

- 

1 

1 

0.5 

0.5 

1 


This makes clear how difficult is to control the crowd in this scenario, since 
leaders have to fight continuously against the natural tendency of the crowd to split 
in subgroups and move randomly, even after that a consensus is reached, cf. m- 
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Fig. 5.1. Setting 0 (no exit is visible here). Microscopic dynamics. The crowd is initially 
confined in the green dashed square. Leaders’ trajectories are in magenta, followers’ are in blue. 
Final positions of followers are in red. Left: no leaders. Center: 5 leaders moving rightward all the 
time. Right: 5 leaders moving rightward, disappearing after a short time. 


A natural question is about the minimum number of leaders needed to lead the 
whole crowd to consensus, i.e. to align all the agents along a desired direction. Numer¬ 
ical simulations suggests that this number strongly depends on the initial conditions. 
Table po~2] gives a rough idea in the case of multiple runs with random initial conditions. 


Table 5.2 

Number of leaders versus percentage of runs where consensus is reached. 


# leaders 

1 

2 

3 

4 

5 

consensus reached 

0 % 

1 % 

27% 

58% 

100 % 


At mesoscopic level we consider the control of the Boltzmann-type dynamics 
(4.6), where we account for small values of the scaling parameter e, in order to be 
sufficiently close to the Fokker-Planck model (4.13). In this case, the control through 
few microscopic leaders has to face the tendencies of the continuous density to spread 
around the domain and to locally align with the surrounding mass. Moreover, their 
action is weakened by the type of interaction considered. Figure [572] shows the results 
corresponding to the kinetic density in the Setting 0. Unlike the microscopic case, no 
splitting occurs where there is no action of the leaders. Instead, the mass smears out 
around the domain, due to the diffusion term. 



Fig. 5.2. Setting 0 (no exit is visible here). Mesoscopic dynamics. The crowd density is initially 
confined in the green dashed square shown in the left figure. Left: no leaders. Center: 5 leaders 
moving rightward all the time. Right: 5 leaders moving rightward, disappearing after a short time. 


5.2. Setting the optimization problem. The functional to be minimized can 
be chosen in several ways. The effectiveness mostly depends on the optimization 
method which is used afterwards. The most natural functional is the evacuation 
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time , 

(5.1) min{i > 0 | Xi(t) ^ Q Mi = 1,..., N F }, 


subject to (3.1) or (4.6) and with u(-) € U a dm, where U a d m is the set of admissible 
controls (including for instance box constraints to avoid excessive velocities). 
Another cost functional, more affordable by standard methods HH5], is 


N F 


N N L 


N L 


(5.2) 


i(x, y, u) = // ii Xi - x t \\ 2 + \\ Xi - y k ii 2 + v E 


Uk || 


*=1 fc=l 


fc =1 


for some positive constants /A, /A, and v. The first term promotes the fact that 
followers have to reach the exit while the second forces leaders to keep contact with the 
crowd. The last term penalizes excessive velocities. This minimization is performed 
at every instant (instantaneous control), or along a fixed time frame [£,, tf] 


(5.3) min / Z(x(t), y(t), u(t)) dt, subject to (|3.1|) or (|4.6|). 

M(-)eG adm J ' - - 


With regards to the mesoscopic scale, both functionals (5.1) and (5.21 can be 


considered, however at the continuous level the presence of few invisible leaders does 
not assure that the whole mass of followers is evacuated. The major difficulty to reach 
a complete evacuation of the continuous density is mainly due to the presence of the 
diffusion term and to the invisible interaction with respect to the leaders. Therefore 
a more appropriate functional is given by the mass evacuated at the final time T, 

(5.4) min < m F (T) = / / f(T,x,v)dxdv\, subject to (|4.6l). 

“(■) eGadm ( J-Rd Jq J 


Remark 5.1. We will not require that leaders themselves reach the exit, but only 
the followers. 


5.2.1. Model predictive control. A computationally efficient way to address 
the optimal control problem ( |5.3[ ) is by means of a relaxed approach known as model 
predictive control (MPC) [io]. We consider a sampling of the dynamics (3.1) at 


every time interval At, and the following minimization problem over N UPC time steps, 
starting from the current time step h , 


(5.5) 


mm 

u{-)eU^dn 


^ + -^VmPC — 1 

E 


/(x(nAt), y(nAt), u(nAt)) 


generating an optimal sequence of controls {u(nAt ),..., u((n + N MFC — l)At)}, from 
which only the first term is taken to evolve the dynamics for a time At, to recast the 
minimization problem over an updated time frame n <r- fi + 1. Note that for N UPC = 2, 
the MPC approach recovers an instantaneous controller, whereas for N MPC = (tf — 
ti)/At it solves the full time frame problem (5.3). Such flexibility is complemented 


with a robust behavior, as the optimization is re-initialized every time step, allowing 
to address perturbations along the optimal trajectory. 
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5.2.2. Modified compass search. When the cost functional is highly irregular 
and the search of local minima is particularly difficult, it could be convenient to move 
towards random methods as compass search (see (8] and references therein), genetic 
algorithms, or particle swarm optimization. In the following we describe a compass 
search method that works surprisingly well for our problem. 

First of all, we consider only piecewise constant trajectories, introducing suitable 
switching times for the leaders’ controls. More precisely, we assume that leaders move 
at constant velocity for a given fixed time interval and when the switching time is 
reached, a new velocity vector is chosen. Therefore, the control variables are the 
velocities at the switching times for each leader. Note that controlling directly the 
velocities rather than the acceleration makes the optimization problem much simpler 
because minimal control variations have an immediate impact on the dynamics. 

Starting from an initial guess, at each iteration the optimization algorithm modi¬ 
fies the current best control strategy found so far by means of small random variations 
of the current values. Then, the cost functional is evaluated. If the variation is ad¬ 
vantageous (the cost decreases), the variation is kept, otherwise it is discarded. The 
method stops when the strategy cannot be improved further. As initial guess for the 
strategy, leaders move at constant speed along the direction joining the target with 
their initial position. 


6. Numerical tests. In what follows we present some numerical tests to validate 
our modeling framework at the microscopic and mesoscopic level. 

The microscopic model (3.1) is discretized by means of the explicit Euler method 
with a time step At = 0.1. The evolution of the kinetic density in (4.13) is approxi¬ 
mated by means of binary interaction algorithms, which approximates the Boltzmann 
dynamics (4.5) with a meshless Monte-Carlo method for small values of the parameter 
e, as presented in @j. We choose e = 0.02, At = 0.01 and a sample of N s = 10000 
particles to reconstruct the kinetic density. This type of approach is inspired by nu¬ 
merical methods for plasma physics and it allows to solve the interaction dynamics 
with a reduced computational cost compared with mesh-based methods, and an ac- 
curacy of 0(N S 7 ). For further details on this class of binary interaction algorithms 
see [U08]. 

Concerning optimization, in the microscopic case we adopt either the compass 
search with functional (5.1) or MPC with functional (5.3). In the mesoscopic case we 
adopt the compass search with functional (5.4). 

We consider three settings for pedestrians, without and with obstacles, hereafter 
referred to Setting 1, 2, and 3, respectively. In Setting 1 we set the compass search 
switching times every 20 time steps, and in Setting 2 every 50, having fixed the 
maximal random variation to 1 for each component of the velocity. In Setting 1, the 
inner optimization block of the MPC procedure is performed via a direct formulation, 
by means of the fmincon routine in MatLab, which solves the optimization problem 
via an SQP method. 


6.1. Setting 1. To begin with, let us consider the case of a large room with no 
obstacles, see Fig. 6.1 The exit is a point located at E = (30,10) which can be reached 
from any direction. We set £ = {x £ R 2 : \x — E\ < 4}. This simple setting helps 
elucidating the role and the interplay of the different terms of our model. Followers are 
initially randomly distributed in the domain [17,29] x [6.5,13.5] with velocity (0,0). 
Leaders, if present, are located to the left of the crowd. Parameters are reported in 
Table EU 
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Fig. 6.1. Setting 1. Left: initial positions of followers (circles) and leaders (squares). Right: 
uniform density of followers and the microscopic leaders (squares). 


6.1.1. Microscopic model. Figure 6.2'first row) shows the evolution of the 
agents computed by the microscopic model, without leaders. Followers having a direct 
view of the exit immediately point towards it, and some group mates close to them 
follow thanks to the alignment force. On the contrary, farthest people split in several 
but cohesive groups with random direction and never reach the exit. 

Figure 6.2 'second row) shows the evolution of the agents with three leaders. The 
leaders’ strategy is defined manually. More precisely, at any time the control is equal 
to the unit vector pointing towards the exit from the current position. Hereafter 
we refer to this strategy as “go-to-target”. Note that the final leaders’ trajectories 
are not straight lines because of the additional repulsion force. As it can be seen, 
the crowd behavior changes completely since, this time, the whole crowd reaches the 
exit. However followers form a heavy congestion around the exit. The shape of the 
congestion is circular: this is in line with the results of other social force models as 
well as physical observation, which report the formation of an “arch” near the exits. 



Fig. 6.2. Setting 1. Microscopic dynamics. First row: no leaders. Second row: three leaders, 
go-to-target strategy. Third row: three leaders, optimal strategy (compass search). 
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The arch is correctly substituted here by a full circle due to the absence of walls. Note 
that the congestion notably delays the evacuation. This suggests that the strategy of 
the leaders is not optimal and can be improved by an optimization method. 

Fig. 6.2'third row) shows the evolution of the agents with three leaders and the 
optimal strategy obtained by the compass search algorithm. Surprisingly enough, the 
optimizator prescribes that leaders divert some pedestrians from the right direction, 
so as not to steer the whole crowd to the exit at the same time. In this way congestion 
is avoided and pedestrian flow through the exit is increased. 

In this test we have also run the MPC optimization, including a box constraint 
Uk(t) e [—1,1]. We choose // = 1, // = 10 -5 , and v = 10 -5 . MPC results are 
consistent in the sense that for N MPC = 2, the algorithm recovers a controlled behavior 
similar to the application of the instantaneous controller (or go-to-target strategy). 
Increasing the time frame up to N MPC = 6 improves both congestion and evacuation 
times, but results still remain non competitive if compared to the whole time frame 
optimization performed with a compass search. 

In Fig. |6.3| we compare the occupancy of the exit’s visibility zone as a function of 
time for go-to-target strategy and optimal strategies (compass search, 2-step, and 6- 
step MPC). We also show the decrease of the value function as a function of attempts 
(compass search) and time (MPC). Evacuation times are compared in Table 6.1 It 
can be seen that only the long-term optimization strategies are efficient, being able 
to moderate congestion and clogging around the exit. 


# Agents inside £ 



Compass search - Value function 
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Fig. 6.3. Setting 1. Optimization of the microscopic dynamics. Top-left: occupancy of the 
exit’s visibility zone E as a function of time for optimal strategy (compass search) and go-to-target 
strategy. Top-right: decrease of the value function (53 as a function of the iterations of the compass 
search (for 50 and 150 followers). Bottom: MPC optimization, occupancy of the exit’s visibility 
zone S as a function of time, CPU time of the optimization call embedded in the MPC solver, and 
the evolution of the corresponding value (2-step and 6-step). 
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Table 6.1 

Setting 1. Evacuation times (time steps). CS=compass search, IG=initial guess. 



no leaders 

go-to-target 

2-MPC 

6 -MPC 

CS (IG) 

N F = 50 

335 

297 

342 

278 

248 (318) 

N F = 150 

OO 

629 

619 

491 

459 (554) 


This suggests a quite unethical but effective evacuation procedure, namely mis¬ 
leading some people to a false target and then leading them back to the right one, 
when exit conditions are safer. Note that in real-life situations, most of the injuries 
are actually caused by overcompression and suffocation rather than urgency. 


6.1.2. Mesoscopic model. We consider here the case of a continuous density 
of followers. Figure 6.4'first row) shows the evolution of the uncontrolled system of 
followers. Due to the diffusion term and the topological alignment, large part of the 
mass spreads around the domain and is not able to reach the target exit. 

In Fig. 6.4 second row) we account the action of three leaders, driven by a go-to- 
target strategy defined as in the microscopic case. It is clear that also in this case the 
action of leaders is able to influence the system and promote the evacuation, but the 
presence of the diffusive term causes the dispersion of part of the continuous density. 
The result is that part of the mass is not able to evacuate, unlike the microscopic 
case. 

In order to improve the go-to-target strategy we rely on the compass search, 
where, differently from the microscopic case, the optimization process accounts the 



Fig. 6.4. Setting 1. Mesoscopic dynamics. First row: no leaders. Second row: three leaders, 
go-to-target strategy. Third row: three leaders, optimal strategy (compass search). 
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objective functional (5.4), i.e., the total mass evacuated at final time. Figure [ 6 ^ third 
row) sketches the optimal strategy found in this way: on one hand, the two external 
leaders go directly towards the exit, evacuating part of the density; on the other hand 
the central leader moves slowly backward, misleading part of the density and only 
later it moves forwards towards the exit. The efficiency of the leaders’ strategy is due 
in particular by the latter movement of the last leader, which is able to gather the 
followers’ density left behind by the others, and to reduce the occupancy of the exit’s 
visibility area by delaying the arrival of part of the mass. 


In Figure [63] we summarize, for the three numerical experiments, the evacuated 
mass and the occupancy of the exit’s visibility area E as functions of time. The 
occupancy of the exit’s visibility area shows clearly the difference between the leaders’ 
action: for the go-to-target strategy, the amount of mass occupying E concentrates 
quickly and the evacuation is partially hindered by the clogging effect, as only 71.3% 
of the total mass is evacuated. The optimal strategy (obtained after 30 iterations) 
is able to better distribute the mass arrival in E, and an higher efficiency is reached, 
evacuating up to 85.2% of the total mass. 



Fig. 6.5. Setting 1. Occupancy of the exit’s visibility zone E, dotted lines, and percentage 
of evacuated mass, star lines, as function of time. Left: histograms for the case without leaders 
(percentage of evacuated mass fl.2%). Center: histograms for leaders moving with the go-to-target 
strategy (percentage of evacuated mass 71.3%). Right: histograms for leaders with an optimal strat¬ 
egy (percentage of evacuated mass 85.2%). 


Remark 6.1. In a realistic context it is impossible to know a priori the initial 
position and velocity of a crowd, but they can be expressed as a joint probability density, 
obtained from statistics on real data. Therefore from the simulation of the kinetic 
model (4.13), one can estimate the evolution of these probabilities and testing the 
efficiency of specific control strategies. 


6.2. Setting 2. In the following we test the microscopic model (with compass 
search optimization) in a more complicated setting, which has also some similarities 
with the one considered in Section [TJ The crowd is initially confined in a rectangular 
room with three walls. In order to evacuate, people must first leave the room and 
then search for the exit point. We assume that walls are not visible, i.e., people can 
perceive them only by physical contact. This corresponds to an evacuation in case of 
null visibility (but for the exit point which is still visible from within E). Walls are 
handled as in [2B| (see also m for an overview of the techniques employed to handle 
obstacles in pedestrian simulations). 

If no leaders are present, the crowd splits in several groups and most of the people 
hit the wall, see Fig. | 6 . 6 | (first row). After some attempts the crowd finds the way out, 
and then it crashes into the right boundary of the domain. Finally, by chance people 
decide, en cascade , to go upward. The crowd leaves the domain in 1162 time steps. 
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If instead we hide in the crowd two leaders who point fast towards the exit (Fig. 
6.6 'second row)), the evacuation from the room is completed in very short time, but 


after that, the influence of the leaders vanishes. Unfortunately, this time people decide 
to go downward after hitting the right boundary, and nobody leaves the domain. 
Slowing down the two leaders helps keeping the leaders’ influence for longer time, 
although it is quite difficult to find a good choice. 

Compass search optimization finds (after 30 iterations) a nice strategy for the 
two leaders which remarkably improves the evacuation time, see Fig. 6.6 third row). 
One leader behaves similarly to the previous case, while the other diverts the crowd 
pointing SE, then comes back to wait for the crowd, and finally points NE towards 
the exit. This strategy allows to bring everyone to the exit in 549 time steps, without 
bumping anyone against the boundary, and avoiding congestion near the exit. 



Fig. 6.6. Setting 2. Microscopic simulation. First row: no leaders (video). Second row: two 
leaders and go-to-target strategy (video). Third row: two leaders and best strategy computed by the 
compass search (video). 


6.3. Setting 3. In the following test we propose a different use of the leaders. 
Rather than steering the mass towards the exit, they can be employed to fluidify 
the evacuation near a door. In fact, it is observed that when several pedestrians 
reach a bottleneck at the same time they slow down each other, possibly coming to a 
deadlock. As a consequence, none of them can pass through the bottleneck. We want 
to investigate the possibility of using leaders as small smart obstacles 1 which alleviate 
the high friction among the bodies just moving randomly near the exit. 

We consider a large room with a single exit door located on the right wall, visible 
from any point (E = R 2 ). We nullify the repulsion force perceived by the leaders, see 
Table |5.1[ Most important, in this test we assume that followers have nonzero size 
and hard collisions are not possible. This is achieved assuming that they are circular¬ 
shaped with diameter 0.25, and keeping frozen those pedestrians who aim at moving 
too close to another pedestrian in one time step. The width of the exit is 0.45. Leaders 
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Fig. 6.7. Setting 3 (exit is visible from any point). Left: evacuation with no leaders. Center: 
evacuation with 3 leaders. Riqht: percentage of evacuation failures as a function of the number of 
leaders, for N F =30, 50, 70. 


stay near the exit at any time and move with a random velocity, each component being 
uniformly distributed in [—1.3,1.3]. We say that the evacuation succeeded if N F — 10 
followers leave the room within 2000 time steps, otherwise we say that the evacuation 
failed. Average quantities are computed running the simulation 200 times. 

Fig. 6.7 shows a typical outcome without (left) and with (center) leaders. Fig. 


6.7 -right shows the percentage of evacuation failures as a function of the number of 
leaders. It is clearly visible that in this scenario the optimal number of leaders is 
3, and this number is independent from the number of evacuees. This is expected 
since leaders act at local level near the exit and do not affect people queuing far 
from the exit. We noted that the leaders initially act as a barrier for the incoming 
pedestrians and slow down the evacuation. Later on, instead, they are quite efficient 
in destroy symmetries and avoid deadlocks. As a by-product, we confirmed once again 
the Braess’s paradox, which states that reducing the freedom of choice (e.g., adding 
obstacles in front of the exit [30] [52]) one can actually improve the overall dynamics. 


7. Validation of the proposed crowd control technique. The crowd control 
technique investigated in the previous sections relies on the fact that pedestrians 
actually exhibit herding behavior in special situations, so that the alignment term 
in the model is meaningful. The herding behavior is investigated since long time, 
especially in the psychological literature. To further confirm this behavior in our 
scenario, we have organized an experiment involving volunteer pedestrians. 

The experiment took place on October 1, 2014 at Department of Mathematics 
of Sapienza - University of Rome, Rome (Italy), at 1 p.m. Participants were chosen 
among first-year students. Since courses started just two days before the date of the 
experiment and the Department has a complex ring-shaped structure, we could safely 
assume that most of the students were unfamiliar with the environment. 

The experiment. Students were informed to be part of a scientific experiment 
aiming at studying the behavior of a group of people moving in a partially unknown 
environment. Their task was to leave the classroom II and reach a certain target inside 
the Department. Students were asked to accomplish their task as fast as possible, 
without running and without speaking with others. Participation was completely 
volunteer and not remunerated (only a celebratory T-shirt was given at later time). 

76 students (39 girls, 37 boys) agreed to participate in the experiment. The 
students were then divided randomly in two groups: group A (42 people) and group 
B (34 people). The two groups performed the experiment independently one after 
the other. The target was communicated to the students just before the beginning of 
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the experiment. It was the Istituto Nazionale di Alta Matematica (INdAM), which 
is located just upstairs w.r.t. the classroom II. Due to the complex shape of the 
environment, there are many paths joining classroom II and INdAM. The shortest 
path requires people to leave the classroom, to go leftward in a little frequented and 
unfamiliar area (even for experienced students), and climb the stairs. The target can 
be also reached by going rightward and climb other, more frequented, stairs. 

In group B there were 5 incognito students, hereafter referred to as leaders, of 
the same age of the others. Leaders were previously informed about the goal of the 
experiment and the location of the target. They were also trained in order to steer 
the crowd toward the target in minimal time. Nobody recognized them as “special” 
before or during the experiment. Unexpectedly, also in group A there was a girl who 
knew the target. Therefore, she acted as an unaware leader. 

It is important to stress that all the other students continued their usual activities 
and participants were not officially recorded by the organizers. This choice was crucial 
to get natural behavior and meaningful results, cf. [251 Sect. 3.4.3]. The price to 
pay is that we had to extract participants’ trajectories by low-resolution videos taken 
by two observers. Moreover, the area just outside classroom II was rather crowded, 
introducing a high level of noise in the experiment. Finally, let us mention that 
some participants broke the rules of the experiment speaking with others and running 
(confirming the natural behavior...). Our experiment can be compared with those 
in [33]! although in those experiments group cohesion was imposed as a rule to the 
participants. 

Results. In Figure 7.1 we show the history of left/right decisions taken by the 
students just outside the two doors of the classroom. Group A used both exits of the 


DOOR 2 


DOOR 1 


DOOR 1 


left 

right 

1 1 11 

^ informed (unaware) 

left 

1 

1 111111111 11111 1 

right 

1 MINI INI 

II 


10 


15 20 25 

time (s) 

^informed (aware) 


30 


running 


35 


40 


10 15 20 25 30 35 40 

time (s) 


Fig. 7.1. Time instants corresponding to the students’ right/left decisions in front of the doors 
for group A (top) and B (bottom). 


classroom. The 8-people group which used door 2 was uncertain about the direction 
for a while. Then, once the first two students decided to go rightward, the others 
decided simultaneously to follow them. Similarly, the first student leaving from door 
1 was uncertain for a few seconds, then he moved rightward and triggered a clear 
domino effect. At t = 20 s the unaware leader moved to the left, inducing hesitation 
and mixed behavior in the followers. After that, another domino effect arose. 
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Group B used only door 1. Invisible leaders were able to trigger a domino effect 
but this time 4 people decided unilaterally not to follow them, although they were not 
informed about the destination. At t = 20 s, after the passage of the last leader, a girl 
passed through the door and went leftward. Then, at t = 22 s, she suddenly began 
to run toward the right. She first induced hesitation, then triggered a new rightward 
domino effect. 

Discussion. Students have shown a tendency to go rightward, being that part of 
the Department more familiar, but this tendency was greatly overcame by the wish to 
follow the group mates in front, regardless of their right/left preferences. Also, after 
the experiment, the students admitted that they have been influenced by the leaders 
because of their clear direction of motion. Interestingly, no more than 5 people have 
been influenced by a single leader. This is compatible with the topological alignment 
term used in our model. The small value of A f here is due to the fact that the space 
in front of the doors was rather crowded and the visibility was reduced. 

It is also interesting to note that only 4 students reached the target alone. This 
confirms the tendency not to remain isolated and to form clusters. Video recordings 
also show that (bad or well) informed people behaved in a rather recognizable manner, 
walking faster, overtaking the others and exhibiting a clear direction of motion, cf. 
[54] . No follower has overtaken other people. 

The 5 incognito leaders adopted a good but non optimal strategy, being too close 
to each other, too much concentrated in the front line, and too fast (cf. Fig. 6.6 second 
row)). This was due to the fact that organizers expected a clearer environment in 
front of the doors and consequently a faster coming out of the people. A better 
distribution in the crowd would lead to better results, although nothing can be done 
against a quite noticeable bad informed person. 


Conclusions and future work. Both virtual and real experiments suggest that 
a bottom-up control technique for crowds like the one proposed in the paper is actually 
feasible and surely deserves further attention. Numerical simulations can give impor¬ 
tant insights about optimal strategies for leaders, which can be then made ethically 
acceptable and tested in real situations. The main open issue regards the minimal 
number of leader needed to reach the desired goal. For that, further theoretical in¬ 
vestigation as well as real experiments with a larger number of participants and a 
clearer environment are needed. The results obtained in Setting 3 should be also 
experimentally confirmed. 
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